Step-function selection models for meta-analysis

effect size
distribution theory
selective reporting
Author

James E. Pustejovsky

Published

July 9, 2024

In a recent post I looked at the distribution of statistically significant effect sizes in studies that report multiple outcomes, when those studies are subject to a certain form of selective reporting. I considered a model where each effect size within the study is more likely to be reported when it is affirmative—or statistically significant and in the hypothesized direction—than when it is non-affirmative. Because studies with multiple effect sizes are a very common occurrence in social science meta-analysis, it’s interesting to think about how this form of selection leads to distortions of the results that are actually reported and available for meta-analysis.

In this post, I want to look a different scenario that is simpler in one respect but more complicated in another. Simpler, in that I’m going to ignore dependence issues and just think about the distribution of one effect size at a time. More complicated, in that I’m going to look at a more general model for how selective reporting occurs, which I’ll call the step-function selection model.

The step-function selection model

The step-function selection model was introduced by Hedges () and has been further expanded, tweaked, and studied in a bunch of subsequent work. The model has two components: a set of assumptions about how effect size estimates are generated prior to selection (the evidence-generation process), and a set of assumptions about how selective reporting happens as a function of the effect size estimates (the selective reporting process).

In the original formulation, the evidence-generation process is a random effects model. Letting Ti denote an effect size estimate prior to selective reporting and σi denote its (known) standard error, we assume that (1)Ti|σiN(μ,τ2+(σi)2), just as in the conventional random effects model. Here μ is the overall average effect size and τ is the standard deviation of the effect size parameter distribution.

For the second component, we assume that the selective-reporting process is fully determined by the statistical significance and sign of the effect size estimates. We can therefore formalize the selective reporting process in terms of the one-sided p-values of the effect size estimates. Assuming that the degrees of freedom are large enough to not worry about, the one-sided p-value for effect size i is a transformation of the effect size and its standard error: (2)pi=1Φ(Ti/σi) In the step-function model, we assume that the probability that an effect size estimate is reported (and thus available for meta-analysis) depends on where this one-sided p-value lies relative to a pre-specified set of significance thresholds, α1,...,αH. These thresholds define a set of intervals, each of which can have a different probability of selection. Let Oi be a binary indicator equal to 1 if Ti is reported and otherwise equal to zero. The selective reporting process is then (3)Pr(Oi=1|Ti,σi)={1ifpi<α1λ1ifαhpi<αh+1,h=1,...,H1λHifαHpi. Note that the selection probability for the lowest interval [0,α1) is fixed to 1 because we can’t estimate the absolute probability that an effect size estimate is reported. The remaining parameters of the selection process therefore each represent a ratio of the probability of reporting an effect size estimate falling in a given interval to the probability of reporting an effect size estimate falling in the lowest interval.

In practice, the analyst will need to specify the thresholds of the significance intervals in order to estimate the model. One common choice is to use only a single threshold at α1=.025, which corresponds to a two-sided level of .05—Fisher’s vaunted criteria for when a result should be considered significant. This is the so-called “three-parameter” selection model, where the parameters of interest are the average effect size μ, the heterogeneity SD τ, and the relative selection probability λ1. Other possible choices for thresholds might be:

  • A single threshold at α1=.50, so that negatively-signed effect size estimates have a different selection probability than positively-signed estimates;
  • A two-threshold model with α1=.025 and α2=.50 (I like to call this a four-parameter selection model); or
  • A model with thresholds for significant, positive results at α1=.025, for the sign of the estimate at α2=.50, and for statistically significant results in the opposite of the expected direction at α3=.975.

Many other choices are possible, of course.

Distribution of observed effect sizes

Equations () and () are sufficient to describe the distribution of effect sizes actually observed after selection. If we let Ti and σi denote effect size estimates that are actually observed, then the distribution of (Ti|σi) is the same as that of (Ti|σi,Oi=1). By Bayes Theorem, (4)Pr(T=t|σi)=Pr(Oi=1|Ti=t,σi=σi)×Pr(Ti=t|σi=σi)Pr(Oi=1|σi=σi) For the specific distributional assumptions of the step-function selection model, we can find an expression for the exact form of (). In doing so, it will be useful to define a further random variable—call it Si—that is equal to the p-value interval into which effect size Ti falls. For a given effect size with standard error σi, these intervals are equivalent to intervals on the scale of the outcome, with thresholds γhi=σiΦ1(1αh). Now, let’s define Si as Si={0ifγ1i<Tihifγh+1,i<Tiγhi,h=1,...,H1HifTiγHi and Si as the corresponding interval for the observed effect size Ti. Note that () is equivalent to writing the relative selection probabilities as a function of Si: (5)w(Ti,σi)=λSi Also note that, prior to selection, the effect size estimate Ti has marginal variance ηi2=τ2+(σi)2, so we can write Pr(Ti=t|σi)=1ηiϕ(tμηi), where ϕ() is the standard normal density. We can then write the distribution of the observed effect size estimates as (6)Pr(Ti=t|σi)=w(t,σi)×1ηiϕ(tμηi)Ai, where (7)Ai=w(t,σi)×1ηiϕ(tμηi)dt=h=0HλhBhi, with Bhi=Φ(γhiμηi)Φ(γh+1,iμηi) and where we take λ0=1, α0=0, and αH+1=1 (). Note that Ai=Pr(O=1|σi), the probability that an effect size estimate with precision σi will be observed.

The observed effect size estimates follow what we might call a “piece-wise normal” distribution. For a given σi and given the interval Si into which the effect size falls, the effect size follows a truncated normal distribution. Formally, (Ti|Si=h,σi)TN(μ,ηi2,γh+1,i,γhi). Furthermore, the distribution of Si is given by Pr(Si=h|σi)=λhBhig=0HλgBgi, which will be useful for deriving moments of the distribution of Ti.

Here is an interactive graph showing the distribution of the effects prior to selection (in grey) and the distribution of observed effect sizes (in blue) based on a four-parameter selection model with selection thresholds of α1=.025 and α2=.50. Initially, the selection parameters are set to λ1=0.6 and λ2=0.3, but you can change these however you like.

math = Object {isNumber: ƒ(e), isComplex: ƒ(e), isBigNumber: ƒ(e), isBigInt: ƒ(e), isFraction: ƒ(e), isUnit: ƒ(e), isString: ƒ(e), isArray: ƒ(), isMatrix: ƒ(e), isCollection: ƒ(e), isDenseMatrix: ƒ(e), isSparseMatrix: ƒ(e), isRange: ƒ(e), isIndex: ƒ(e), isBoolean: ƒ(e), isResultSet: ƒ(e), isHelp: ƒ(e), isFunction: ƒ(e), isDate: ƒ(e), isRegExp: ƒ(e), …}
norm = Module {Z: ƒ(…), cdf: ƒ(z), icdf: ƒ(…), intE: ƒ(a, b), pdf: ƒ(z), Symbol(Symbol.toStringTag): "Module"}
eta = 0.223606797749979
H = 2
alpha = Array(2) [0.025, 0.5]
lambda = Array(3) [1, 0.6, 0.3]
lambda_max = 1
findlambda = ƒ(p, alp, lam)
findMoments = ƒ(mu, tau, sigma, alp, lam)
moments = Object {Ai: 0.5804803322995085, ET: 0.22103888373928482, SDT: 0.21843435492636226}
Ai_toprint = "0.580"
ET_toprint = "0.221"
eta_toprint = "0.224"
SDT_toprint = "0.218"
pts = 201
dat = Array(201) [Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, …]
0.00.20.40.60.81.01.21.41.6↑ Density−0.4−0.20.00.20.40.60.8Effect size estimate (Ti) →
μ=0.15ηi=0.224E(Ti)=0.221V(Ti)=0.218Pr(Oi=1)=0.580 \begin{aligned} \mu &= 0.15 & \qquad \eta_i &= 0.224 \\ \mathbb{E}\left(T_i\right) &= 0.221 & \qquad \sqrt{\mathbb{V}\left(T_i\right)} &= 0.218 \\ \Pr(O_i^* = 1) &= 0.580 \end{aligned}
mu = 0.15
tau = 0.1
sigma = 0.2
lambda1 = 0.6
lambda2 = 0.3

Moments of Ti|σi

Using the auxiliary random variable Si makes it pretty straight-forward to find the moments of Ti|σi. Let me denote ψhi=ϕ(ch+1,i)ϕ(chi)Bhi, where chi=(γhiμ)/ηi for h=0,...,H. Then from the properties of truncated normal distributions, E(Ti|Si=h,σi)=μ+ηi×ψhi, It follows immediately that (8)E(Ti|σi)=h=0HPr(Si=h|σi)×E(Ti|Si=h,σi)=μ+ηih=0HλhBhiψhih=0HλhBhi. The second term of () is the bias of Ti relative to the overall mean effect μ. Generally, it will depend on all the parameters of the evidence-generating process, including σi, μ, and τ (through the chi and Bhi terms) and on the selection weights λ1,...,λH.

Just for giggles, let me chug through and get the variance of Ti as well. Letting ψ¯i=h=0HλhBhiψhih=0HλhBhi and κhi=chiϕ(chi)ch+1,iϕ(ch+1,i)Bhi, we can write the variance of the truncated normal conditional distribution V(Ti|Si=h,σi)=ηi2(1κhiψhi2). Using variance decomposition, we then have (9)V(Ti|σi)=E[V(Ti|Si,σi)]+V[E(Ti|Si,σi)]=E[V(Ti|Si,σi)+[E(Ti|Si,σi)E(Ti|σi)]2]=1h=0HλhBhih=0HλhBhi(ηi2(1κhiψhi2)+ηi2[ψhiψ¯i]2)=ηi2(1κ¯iψ¯i2), where κ¯i=h=0HλhBhiκhih=0HλhBhi Just as with the expectation, the variance is a complicated function of all the model parameters.

Funnel density

The graph above shows the distribution of observed effect size estimates with a given sampling standard error σi. In practice, meta-analysis datasets include many effect sizes with a range of different standard errors. Funnel plots are a commonly used graphical representation the distribution of effects in a meta-analysis. They are simply scatterplots, showing effect size estimates on the horizontal axis and standard errors (or some measure of precision) on the vertical axis. Usually, they are arranged so that effects from larger studies appear closer to the top of the plot. Funnel plots are often used a diagnostic for selective reporting because they will tend to be asymmetric when non-affirmative effects are less likely to be reported than affirmative effects.

I think it’s pretty useful to use the layout of a funnel plot to understand how meta-analytic models work. A basic random effects model implies a certain distribution of population effects, which can be represented by the density of points in a funnel plot. That density will have a shape kind of like an upside down funnel: narrow near the top (where studies are large and σi is small), getting wider and wider as σi increases (i.e., as studies get smaller and smaller).

Here’s an illustration of this density, using darker color to indicate areas of the plot where effect sizes are more likely. I’ve used μ=0.15 (represented by the vertical red line) and τ=0.10 to calculate the density. The vertical gray line corresponds to μ=0, which is also the threshold where effect size estimates will have p-values of α=.50. The sloped gray line corresponds to the treshold where effect size estimates have p-values of α=.025; to the right of this line, effects will be statistically significant and affirmative; to the left, effects are non-affirmative.

The above graph shows the (conditional) distribution of effect size estimates under the random effects model, without any selective reporting. Selective reporting of study results will distort this distribution, shrinking the density for effects that are not affirmative.

Here is an interactive funnel plot showing the distribution of effect size estimates under a four-parameter selection model. Just as in the interactive graph above, I use fixed selection thresholds of α1=.025 and α2=.50. Initially, I set μ=0.15 and τ=0.10 and selection parameters of λ1=0.6 and λ2=0.3, but you can change these however you like.

SE_pts = 100
t_pts = 181
lambda_f = Array(3) [1, 0.6, 0.3]
alpha_f = Array(2) [0.025, 0.5]
sigma_max = 0.5
eta_max_f = 0.5099019513592785
funnel_dat = Array(18100) [Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, Object, …]
sigline_dat = Array(2) [Object, Object]
0.500.450.400.350.300.250.200.150.100.050.00↓ Standard error (sigma_i)−1.0−0.50.00.51.01.5Effect size estimate (Ti) →
0.00.20.40.60.81.0Density
mu_f = 0.15
tau_f = 0.1
lambda1_f = 0.6
lambda2_f = 0.3

If you fiddle with the selection parameters, you will see that the density of certain areas of the plot changes. For instance, lowering λ2 will reduce the density of negative effect size estimates; lowering λ1 will reduce the density of positive but non-affirmative effect size estimates, which fall between the vertical axis and the diagonal line corresponding to α=.025.

Comment

In this post, I’ve given expressions for the density of effect size estimates under the step-function selection model, as well as expressions for the mean and variance of effect size estimates of a given precision (i.e., for Ti|σi). Although these expressions are pretty complex, it seems like they could be useful for studying the properties of different estimators that have been proposed for dealing with selective reporting, such as the “unrestricted weighted least squares” method, which is just the idea of using fixed effects weights even though the effects are heterogeneous (; ); the PET and PEESE estimators (); the endogenous kink meta-regression (); and perhaps other estimators in the literature. Graphical depictions of the step function density (as in the funnel plot above) also seem potentially useful for understanding the properties of these estimators.

Back to top

References

Bom, P. R. D., & Rachinger, H. (2019). A kinked meta-regression model for publication bias correction. Research Synthesis Methods, 10(4), 497–514. https://doi.org/10.1002/jrsm.1352
Hedges, L. V. (1992). Modeling publication selection effects in meta-analysis. Statistical Science, 7(2), 246–255.
Hedges, L. V., & Vevea, J. L. (2005). Selection method approaches. In Publication bias in meta-analysis: Prevention, assessment, and adjustments (pp. 145–174). John Wiley & Sons.
Henmi, M., & Copas, J. B. (2010). Confidence intervals for random effects meta-analysis and robustness to publication bias. Statistics in Medicine, 29(29), 2969–2983.
Stanley, T. D. (2008). Meta-regression methods for detecting and estimating empirical effects in the presence of publication selection*. Oxford Bulletin of Economics and Statistics, 70(1), 103–127. https://doi.org/10.1111/j.1468-0084.2007.00487.x
Stanley, T. D., & Doucouliagos, H. (2014). Meta-regression approximations to reduce publication selection bias. Research Synthesis Methods, 5(1), 60–78.

Footnotes

  1. To be precise, the density will depend on the marginal distribution of σi’s in the population of effects. I’m going to side-step this problem by using the funnel plot layout to show the conditional distribution of the effect size estimates, given σi=σi.↩︎